
In the Specification : 

Please amend the specification as follows: 

Please replace the paragraph beginning on page 1, line 20, with the following 
rewritten paragraph: 

-Hitherto, in a document management system using a network environment, a 
document information search apparatus which can search necessary documents from a large 
amount of document data existing on the Internet or Ethernet and can rapidly refer to it has 
been provided. In a document search in such a case, the user designates one or a plurality of 
proper words or character trains which are considered to be included in the necessary 
documents as keywords and searches the documents including the word(s) of the designated 
keyword from a search database, and a document list is displayed as a search result. In such 
a document information search apparatus, with respect to search target documents existing on 
the network, an index describing a list of important words extracted from their contents is 
formed for every document and stored in the search database. If a search request in which a 
keyword is designated is issued from the user, the index in the search database is searched, 
thereby outputting a search result of the document list. Further, according to the 
conventional document information search apparatus, after the user searched the documents 
which are considered to be necessary from the document list searched by the designation of 
the keyword, if a search of similar documents is designated with respect to the selected 
documents, words in which a frequency of appearance in the search documents is high are 
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automatically extracted and added to previously executed search conditions on the basis of a 
condition of the OR, so that the similar documents can be searched.-- 



Please replace the paragraph beginning on page 5, line 12, with the following 
rewritten paragraph: 

-The search condition designating unit on the search requesting source 
transmits a head file portion of the designated file contents. Since many important keywords 
necessary for document search exist usually in a head portion of a document, only the head 
portion of the file contents, for example, the head portion of 1 kB is transmitted as a search 
condition. Since the document files which are used for the search condition have various 
sizes, by deciding a capacity of the file which is transmitted as a search condition, a 
communication load and the processes on the search side are reduced. The search condition 
designating unit includes an HTML file and an Excel file as files which are designated as a 
search condition. Even in the other file formats, the files include a file of an arbitrary file 
format so long as it is a file from which a text document can be extracted. A database in 
which index information describing a list of important words extracted from the search target 
documents has been stored for every document is provided for the search machine on the 
server side. The document search unit of the search machine comprises: a text extraction 
processing unit which extracts a text document from the file contents received in response to 
the search request; a morpheme analyzing unit which extracts nouns by a morpheme analysis 
of the text document; a keyword forming unit which extracts important words from the nouns 




and forms a keyword in which the important words are coupled by OR; and a search 
executing unit which searches similar documents by searching the search database by the 
keyword and notifies the client of a search result. The keyword forming unit counts the 
number (H) of times of appearance showing in which document^ in the index of each search 
document stored in the search database^ each noun appears and selects a predetermined 
number of upper words each having the number (H) of times of appearance in a 
predetermined range, thereby forming the keyword. When the number of documents in the 
index is assumed to be N, the keyword forming unit selects the upper ten words each having 
the number of times of appearance in a range in which the number (H) of times of 
appearance is equal to, for example, 
2N/3 >H> 1 

and forms the keyword. Thus, the important words which have been registered in the index 
of the database and are necessary for the similar document search of the existing documents 
are narrowed down, thereby raising precision of the similar document search. Further, the 
keyword forming unit allows property information extracted from the received file to be 
included in the keyword and searched in response to the search request. As property 
information in this case, there are a writer of the file received in response to the search 
request, a document title, and the like. By adding the property information of the file into the 
search conditions as mentioned above, for example, in the case where the user wants to 
specify the writer or the like, the similar documents can be properly narrowed down. The 
search condition designating unit of the search requesting source is provided by a WWW 
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browser of the client and the contents of the file designated on a search request picture plane 
of the WWW browser are transmitted to a WWW server through the network and sent to the 
search machine.-- 
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